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Estimating the selectivity of multidimensional range queries over real valued attributes 
has significant applications in data exploration and database query optimization. In this 
paper, we consider the following problem: given a table of d attributes whose domain is 
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Attributes of a relation are not typically independent. Multidimensional histograms can be 
an effective tool for accurate multiattribute query selectivity estimation. In this paper, we 
introduce STHoles, a "workload-aware" histogram that allows bucket nesting to capture 
data regions with reasonably uniform tuple density. STHoles histograms are built without 
examining the data sets, but rather by just analyzing query results. Buckets are allocated 
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Several studies have demonstrated the effectiveness of the wavelet decomposition as a 
tool for reducing large amounts of data down to compact wavelet synopses that can be 
used to obtain fast, accurate approximate query answers. Conventional wavelet synopses 
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that greedily minimize the overall root-mean-squared (i.e., L2-norm) error in the data 
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The graphics processor (GPU) on today ! s commodity video cards has evolved into an 
extremely powerful and flexible processor. The latest graphics architectures provide 
tremendous memory bandwidth and computational horsepower, with fully programmable 
vertex and pixel processing units that support vector operations up to full IEEE floating 
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Database management systems will continue to manage large data volumes. Thus, 
efficient algorithms for accessing and manipulating large sets and sequences will be 
required to provide acceptable performance. The advent of object-oriented and extensible 
database systems will not solve this problem. cOn the contrary/ modern^data models 
' exacerbate the problem: In order to manipulate large sets of complex objects as 
efficiently as today's database systems manipulate simple records, query-processi ... 
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Approximate query processing has emerged as a cost-effective approach for dealing with 
the huge data volumes and stringent response-time requirements of today's decision 
support systems (DSS). Most work in this area, however, has so far been limited in its 
query processing scope, typically focusing on specific forms of aggregate queries. 
Furthermore, conventional approaches based on sampling or histograms appear to be 
inherently limited when it comes to approximating the results of complex queri ... 
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Recent work has demonstrated the effectiveness of the wavelet decomposition in reducing 
large amounts of data to compact sets of wavelet coefficients (termed "wavelet 
synopses") that can be used to provide fast and reasonably accurate approximate query 
answers. A major shortcoming of these existing wavelet techniques is that the quality of 
the approximate answers they provide varies widely, even for identical queries on nearly 
identical values in distinct parts of the data. As a result, users ha ... 
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This article presents a novel framework for the symbolic bounds analysis of pointers, 
array indices, and accessed memory regions. Our framework formulates each analysis 
problem as a system of inequality constraints between symbolic bound polynomials. It 
then reduces the constraint system to a linear program. The solution to the linear 
program provides symbolic lower and upper bounds for the values of pointer and array 
index variables and for the regions of memory that each statement and procedur ... 
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Data sets in large applications are often too massive to fit completely inside the 
computers internal memory. The resulting input/output communication (or I/O) between 
fast internal memory and slower external memory (such as disks) can be a major 
performance bottleneck. In this article we survey the state of the art in the design and 
analysis of external memory (or EM) algorithms and data structures, where the goal is to 
exploit locality in order to reduce the I/O costs. We consider a varie ... 
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Approximating the joint data distribution of a multi-dimensional data set through a 
compact and accurate histogram synopsis is a fundamental problem arising in numerous 
practical scenarios, including query optimization and approximate query answering. 
Existing solutions either rely on simplistic independence assumptions or try to directly 
approximate the full joint data distribution over the complete set of attributes. 
Unfortunately, both approaches are doomed to fail for high-dimensional dat ... ! 
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Histograms and related synopsis structures are popular techniques for approximating data 
distributions. These have been successful in query optimization and a variety of 
applications, including approximate querying, similarity searching, and data mining, to 
name a few. Histograms were a few of the earliest synopsis structures proposed and 
continue to be used widely. The histogram construction problem is to construct the best 
histogram restricted to a space bound that reflects the data t distributi .... , 
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While much recent research has focussed on extending databases, beyond the traditional 
relational model, relatively little has been done to develop database tools for querying 
data organized in (multidimensional) arrays. The scientific computing community has 
made little use of available database technology. Instead, multidimensional scientific data 
is typically stored in local files conforming to various data exchange formats and queried 
via specialized access. libraries tied in to general purpo&...^t!un ursu a vu.it.. , 
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Embedded processors for video image recognition require to address both the cost (die 
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size and power) versus real-time performance issue, and also to achieve high flexibility 
due to the immense diversity of recognition targets, situations, and applications. This 
paper describes IMAP, a highly parallel SIMD linear processor and memory array 
architecture that addresses these trading-off requirements. By using parallel and systolic 
algorithmic techniques, despite of its simple architecture IMAP ach ... 
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Bitmap indices are efficient for answering queries on low-cardinality attributes. In this 
article, we present a new compression scheme called Word-Aligned Hybrid (WAH) code 
that makes compressed bitmap indices efficient even for high-cardinality attributes. We 
further prove that the new compressed bitmap index, like the best variants of the B-tree 
index, is optimal for one-dimensional range queries. More specifically, the time required 
to answer a one-dimensional range query is a linear f ... 
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Full text available: ^gpdf(17.07 MB) Additional Information: full citation , abstract , citings 

Level set methods, an important class of partial differential equation (PDE) methods, 
define dynamic surfaces implicitly as the level set (iso-surface) of a sampled, evolving nD 
function. The course begins with preparatory material that introduces the concept of using 
partial differential equations to solve problems in computer graphics, geometric modeling 
and computer vision. This will include the structure and behayiorof several different types 
of differential equations, e.g. the level set eq ... 
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Arrays are a common and important class of r data.°At"present, database systems do not 
provide adequate array support: arrays can neither be easily defined nor conveniently 
manipulated. Further, array manipulations are not optimized. This paper describes a 
language called the Array Manipulation Language (AM L), for expressing array 
manipulations, and a collection of optimization techniques for AMLexpressions.In the AML 
framework for array manipulation, arbitrai^.externally-defmedjunctions ... 

Keywords: Array manipulation language, Array query optimization, Declarative query 
language, Memory-usage optimization, Pipelined evaluation, User-defined functions 
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We present a survey of the state-of-the-art techniques used in performing data and 
memory-related optimizations in embedded systems. The optimizations are targeted 
directly or indirectly at the memory subsystem, and impact one or more out of three 
important cost metrics: area, performance, and power dissipation of the resulting 
implementation. We first examine architecture-independent optimizations in the form of 
code transoformations. We next cover a broad spectrum of optimizati ... 
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Current optical motion capture devices are capable of capturing motion at frequencies 
exceeding 1000Hz thereby generating gigabytes of motion data. In this paper wei propose 
a method to solve the problem of subsequence motion matching in large motion 
databases. Our method supports non-uniform time-scaling. We begin with a polar-angle 
representation of the motion that gives a continuous thread in a multi-dimensional space. 
We improve the performance of the matching process by generating a motion c ... 
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In this paper we perform an extensive theoretical and experimental study, on common 
synopsis construction algorithms, with emphasis on wavelet based techniques, that take 
under consideration query workload statistics. Our goal is to compare, "expensive" 
quadratic time algorithms with "cheap" near-linear time algorithms, particularly when the 
latter are not optimal and/or not workload-aware for the problem at hand. Further, we 
present the first known algorithm for constructing wavelet synopses for .;<.v- = 
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With continued advances in communication network technology and sensing technology, 
there is astounding growth in the amount of data produced and made available through 
cyberspace. Efficient and high-quality clustering of large datasets continues to be one of 
the most important problems in large-scale data analysis. A commonly used methodology 
for cluster analysis on large datasets is the three-phase framework of 
sampling/summarization, iterative cluster analysis, and disk-labeling. There are th ... 
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Considering the analogy between image segmentation and cluster analysis, the aim of this paper is to 
adapt statistical texture measures to describe the spatial distribution of multidimensional 
observations. The main idea is to consider the cluster cores as domains characterized by their specific 
textures in the data space. The distribution of the data points is first described as a multidimensional 
histogram defined on a multidimensional regular array of sampling points. In order to evaluate 
locally a multidimensional texture, a co-occurrence matrix is introduced, which characterizes the 
local distribution of the data points in the multidimensional data space. Several local texture features 
can be computed from this co-occurrence matrix, which accumulates spatial and statistical information 
on the data distribution in the neighborhoods of the sampling points. Texture features are selected 
according to their ability to discriminate different distributions of data points. The sampling points 
where the local underlying texture is evaluated are categorized into different texture classes. The 
points assigned to these classes tend to form connected components in the data space, which are 
considered as the cores of the clusters. (All rights reserved Elsevier). 
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Abstract 

This paper presents Sassy, a single-assignment variant of the C programming language developed in 
concert with Khoral Incorporated, and designed to exploit both coarse-grain and fine-grain parallelism 
in image processing applications. Sassy programs are written in the Khoros software development 
environment, and can be manipulated inside Cantata (the Khoros GUI). The Sassy language supports 
image processing with true multidimensional arrays, sophisticated array access and windowing 
mechanisms, and built-in reduction operators (e.g., histogram). At the same time, Sassy restricts C 
so as to enable compiler optimizations for parallel execution environments, with the goal of reducing 
data traffic, code size and execution time. In particular, the Sassy language and its optimizing compiler 
target reconfigurable systems, which are fine-grain parallel processors. Reconfigurable systems consist 
of field-programmable gate arrays (FPGA), memories and interconnection hardware, and can be used 
as inexpensive co-processors with conventional workstations or PC. The compiler optimizations needed 
to generate highly optimal host, FPGA, and communication code, are discussed. The massive 
parallelism and high throughput of reconfigurable systems makes them well-suited to image processing 
tasks, but they have not previously been used in this context because they are typically programmed in 
hardware description languages such as VHDL. Sassy was developed as part of the Cameron project, 
with the goal of elevating the programming level for reconfigurable systems from hardware circuits to 
programming language. 
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The posterior capsule opacification images considered are images of the membrane encapsulating an 
artificial lens implanted during cataract surgery in place of the natural lens. The images are taken to 
monitor the state of the patient's vision after the surgery. Subsequent to the surgery, the membrane of 
the posterior capsule may become opacified, thus degrading the patient's vision. The authors discuss 
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